Skip to content

Conversation

@lee1043
Copy link
Contributor

@lee1043 lee1043 commented Oct 21, 2025

Fix misrepresented data range from grouped violin plot.

In the parallel coordinate plot, grouped violin plot uses seaborn.violinplot and by its default it extends the density distribution to past extreme datapoints by 2 bandwidth (cuts=2 parameter used by default). So in case like data value range is 0-5, then the violin distribution is shown for ~ -2 to 7.

Example with dummy data [0, 1, 2, 3, 4, 5], with default setting (cuts=2)
output2

The violin range can be limited to the actual value range by using parameter cuts=0, as in the second attached.

Example with dummy data [0, 1, 2, 3, 4, 5], with default setting (cuts=0)
output1

@lee1043 lee1043 linked an issue Oct 21, 2025 that may be closed by this pull request
@lee1043 lee1043 marked this pull request as ready for review October 21, 2025 22:09
@lee1043 lee1043 self-assigned this Oct 21, 2025
@lee1043 lee1043 added the bug label Oct 21, 2025
@lee1043 lee1043 added this to the 4.0.2 milestone Oct 21, 2025
@lee1043 lee1043 merged commit 1d2d595 into main Oct 21, 2025
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: Parallel coordinate plot: violin distribution issue

2 participants